首页> 外文OA文献 >On continuous space word representations as input of LSTM language model

【2h】

On continuous space word representations as input of LSTM language model

机译：以连续的空间词表示形式作为LSTM语言模型的输入

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Artificial neural networks have become the state-of-the-art in the task of language modelling whereas Long-Short Term Memory (LSTM) networks seem to be an efficient architecture. The continuous skip-gram and the continuous bag of words (CBOW) are algorithms for learning quality distributed vector representations that are able to capture a large number of syntactic and semantic word relationships. In this paper, we carried out experiments with a combination of these powerful models: the continuous representations of words trained with skip-gram/CBOW/GloVe method, word cache expressed as a vector using latent Dirichlet allocation (LDA). These all are used on the input of LSTM network instead of 1-of-N coding traditionally used in language models. The proposed models are tested on Penn Treebank and MALACH corpus.

机译：人工神经网络已成为语言建模任务中的最新技术，而长短期记忆（LSTM）网络似乎是一种有效的体系结构。连续跳跃语法和连续词袋（CBOW）是用于学习能够捕获大量句法和语义词关系的质量分布矢量表示的算法。在本文中，我们结合了这些强大的模型进行了实验：使用skip-gram / CBOW / GloVe方法训练的单词的连续表示，使用潜在Dirichlet分配（LDA）表示为矢量的单词缓存。这些全部用于LSTM网络的输入，而不是传统上在语言模型中使用的1-of-N编码。建议的模型在Penn Treebank和MALACH语料库上进行了测试。

著录项

作者
Soutner, Daniel; Müller, Luděk;
展开▼
作者单位

展开▼
年度 2015
总页数
原文格式 PDF
正文语种 en
中图分类

相似文献

外文文献
中文文献
专利

1. Discovering Finance Keywords via Continuous-Space Language Models [J] . MING-FENG TSAI, CHUAN-JU WANG, PO-CHUAN CHIEN ACM Transactions on Management Information Systems . 2016 ,第3期

机译：通过连续空间语言模型发现财务关键词
2. Learning Syllables Using Conv-LSTM Model for Swahili Word Representation and Part-of-speech Tagging [J] . Shivachi Casper Shikali, Mokhosi Refuoe, Zhou Shijie, ACM transactions on Asian and low-resource language information processing . 2021 ,第4期

机译：使用Conv-LSTM模型进行斯瓦希里语字表示和词语标记的学习音节
3. Cross-language plagiarism detection over continuous-space- and knowledge graph-based representations of language [J] . Franco-Salvador Marc, Gupta Parth, Rosso Paolo, Knowledge-Based Systems . 2016 ,第nova1期

机译：基于连续空间和知识图的语言表示形式的跨语言窃检测
4. On Continuous Space Word Representations as Input of LSTM Language Model [C] . Daniel Soutner, Ludek Mueller International conference on statistical language and speech processing . 2015

机译：作为LSTM语言模型输入的连续空间词表示形式
5. An exploration of the word2vec algorithm: Creating a vector representation of a language vocabulary that encodes meaning and usage patterns in the vector space structure [D] . Le, Thu Anh. 2016

机译：word2vec算法的探索：创建语言词汇的矢量表示，该矢量表示编码矢量空间结构中的含义和用法模式
6. EEG decoding of spoken words in bilingual listeners: from words to language invariant semantic-conceptual representations [O] . João M. Correia, Bernadette Jansma, Lars Hausfeld, -1

机译：双语听众中口语的EEG解码：从单词到语言不变的语义概念表示
7. Putting Words in Context: LSTM Language Models and Lexical Ambiguity [O] . Laura Aina, Kristina Gulordava, Gemma Boleda 2019

机译：在上下文中放入词语：LSTM语言模型和词汇歧义

On continuous space word representations as input of LSTM language model

摘要

著录项

相似文献

相关主题

期刊订阅